Deals Among Rational Agents 


Jeffrey S. Rosenschein 
Michael R. Genesereth 

Computer Science Department 
Stanford University 
Stanford, California 94305 


Abstract 

A formal framework is presented that models commu¬ 
nication and promises in multi-agent interactions. This 
framework generalizes previous work on cooperation with¬ 
out communication, and shows the ability of communi¬ 
cation to resolve conflicts among agents having disparate 
goals. Using a deal-making mechanism, agents are able to 
coordinate and cooperate more easily than in the commu¬ 
nication-free model. In addition, there arc certain types 
of interactions where communication makes possible mu¬ 
tually beneficial activity that is otherwise impossible to 
coordinate. 

§1. Introduction 

1.1 The Muiti-Agent Paradigm and Al 

Research in artificial intelligence has focused for many 
years on the problem of a single intelligent agent. This 
agent, usually operating in a relatively static domain, was 
designed to plan, navigate, or solve problems under certain 
simplifying assumptions, most notable of which was the 
absence of other intelligent entities. 

The presence of multiple agents, however, is an un¬ 
avoidable condition of the real world. People must plan 
actions taking into account the potential actions of others, 
which might be a help or a hindrance to their own activ¬ 
ities. In order to reason about others' actions, a person 
must be able to model their beliefs and desires. 

The artificial intelligence community has only lately 
come to address the problems inherent in multi-agent ac¬ 
tivity. A community of researchers, working on distributed 
artificial intelligence (DAI), has arisen. Even as they have 
begun their work, however, these researchers have added 
on a new set of simplifying assumptions that severely re¬ 
strict the applicability of their results. 

1.2 Benevolent Agents 

Virtually all researchers in DAI have assumed that the 
agents in their domains have common or non-conflicting 
goals. Work has thus proceeded on the question of how 
these agents can best help one another in carrying out 
their common tasks [3, 4, 6, 7, 24], or how they can avoid 
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interference while using common resources [10, 11], Multi¬ 
ple agent interactions are studied so as to gain the benefits 
of increased system efficiency or increased capabilities. 

Of course, when there is no conflict, there is no need to 
study the wide range of interactions that can occur among 
intelligent agents. All agents are fundamentally assumed 
to be helping one another, and will trade data and hy¬ 
potheses as well as carry out tasks that are requested of 
them. We call this aspect of the paradigm the benevolent 
agent assumption. 

1.3 Interactions of a More General Nature 

In the real world, agents are not necessarily benevolent 
in their dealings with one another. Each agent has its 
own set of desires and goals, and will not necessarily help 
another agent with information or with actions. Of course, 
while conflict among agents exists, it is not total. There 
is often potential for compromise and mutually beneficial 
activity. Previous work in distributed artificial intelligence, 
bound to the benevolent agent assumption, has generally 
been incapable of handling these types of interactions. 

Intelligent agents capable of interacting even when 
their goals are not identical would have many uses. For 
example, autonomous land vehicles (ALV's), operating in 
a combat environment, can be expected to encounter both 
friend and foe. In the latter case there need not be total 
conflict, and in the former there need not be an identity of 
interests. Other domains in which general interactions are 
prevalent arc resource allocation and management tasks. 
An automated secretary [12], for example, may be required 
to coordinate a schedule with another automated (or hu¬ 
man) secretary, while properly representing the desires of 
its owner. The ability to negotiate, to compromise and 
promise, would be desirable in these types of encounters. 

Finally, even in situations where all agents in theory 
have a single goal, the complexity of interaction might be 
better handled by a framework that recognizes and resolves 
sub-goal conflict in a general manner. For example, robots 
involved in the construction of a space station arc fun¬ 
damentally motivated by the same goal; in the course of 
construction, however, there may be many minor conflicts 
caused by occurrences that cannot fully be predicted (e.g., 
fuel running low, drifting of objects in space). The build¬ 
ing agents, each with a different task, could then negotiate 
with one another and resolve conflict. 
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1.4 Game Theory's Model and Extensions 

In modeling the interaction of agents with potentially 
diverse goals, we borrow the simple construct of game the¬ 
ory, the payoff matrix. Consider the fqllowiug matrix: 


a 

b 


c 

d 

~W' 

2 

2\5 

0\1 


The first player is assumed to choose one of the two 
rows, while the second simultaneously picks one of the two 
columns. The row-column outcome determines the payoff 
to each; for example, if the first player picks row b and 
the second player picks column c, the first player receives 
a payoff of 2 while the second receives a payoff of 5. If 
the choice results in an identical payoff for both players, a 
single number appears in the square (e.g., the a\d payoff 
above is 2 for both players). Payolls designate utility to 
the players of a particular joint move [18]. 

Game theory addresses the issues of what moves a 
rational agent will make, given that other agents are also 
rational. We wish to remove the a priori assumption that 
other agents will necessarily be rational, while at the same 
time formalizing the concept of rationality in various ways. 

Our model in this paper allows communication among 
the agents in the interaction, and allows them to make 
binding promises to one another. The agents are assumed 
to be making their decisions based only on the current 
encounter (e.g., they won't intentionally choose a lower 
utility in the hope of gaining more utility later on). The 
formalism handles the case of agents with disparate goals 
as well as the case of agents with common goals. 

§2. Notation 

We expand on the notation developed in [8], For each 
game there is a set P of players and, for each player i G P, 
a set M, of possible moves for i. For S C P, we denote 
P - S by S, and write i instead of {t} (so T — P — {i}). 
We write Ms for ri/r.s- Mi- 

We denote by ms an element of Ms: this is a joint 
move for the players in 6\ To ms £ Ms and tn£ £ 
correspond an element m of Mp. The payofT function for 
a game is a function 


p : P x Mp -+ R 

whose value at (i,mj is the payoff for player i if move m 
is made. 

Each agent is able to specify a set ofjoint moves (i.e., 
elements of Mp) that specify outcomes the agent is willing 
to accept; this set is called an offer group. If any move 
or moves offered by one agent are likewise offered by all 


other agents, this set of moves constitutes the deal (i.e., the 
deal is the intersection of all the agents' offer groups). In 
practice, a single element of the deal set will be selected by 
a fair arbiter, and the result of the selection communicated 
to all agents. At that point, the agents are all compelled 
to carry out their part of the move. Of course, if the deal 
set has only one member, no arbiter is needed. 

We define a secondary payoff function pay{i, rrt,,D,), 
the set of possible payoffs to / of making move m,, and 
suggesting offer group P, 


poj/(i,w,,P l ) - 


(jf) : d G P, A 

d0,|0, t atlowed„{i, P,) A 
d C 0,]}, if such a d exists; 
{p(t,r7i) : m r G a/fouied,,,^,™,)}, 
l otherwise. 


We designate by aiiuwed rn {i,mi) the set of moves that 
other agents might potentially make while / makes move 
m,, and by allowedo{i,D h ),) the set of offers that other 
agents might make while i suggests offer group D,. Our 
formalism implicitly separates offer groups from moves (in 
other words, there will be no effect on moves by offer 
groups or vice versa). Intuitively, this reflects simulta¬ 
neously revealing one's move and offer group, with one's 
eventual action determined by others' offer groups (that 
is, only if there is no agreement will you have to carry out 
your move). Future work might investigate the situation 
where offers arc made before moves are chosen, and may 
thus affect them. 

For nonempty sets {«,} and {fij}, wc write {a,} < 
{0j} if a, •; fij for all i,j (and say that {ftj } strictly domi¬ 
nate..-: {«;})■ Likewise, we write {«,} < {/?,} for nonempty 
sets {«,} and {/?_,} if a, < for all i,j and the inequal¬ 
ity is strict in at least one case. Wc then say that {/?y} 
dominates {Qj}. 

Finally, we define jj(S, r/.s) as {p(», y) : i £ S A yg G 
Ms }. These arc the possible payoffs to a group S of players 
of making move yg. 

2.1 Rational Moves 

We will denote by // m (p,t) the set of rational moves 
for agent % in game p. We use the following definition 
to constrain what moves arc elements ofthat is, 
what moves arc rational (we will follow the convention that 
free variables are considered universally quantified): 


< pny(*,K,,0) => y, i (1) 

In other words, if, when no binding agreement will be 
reached, every possible payoff to / of making move y t is less 
than every possible payoff to /' of making move x t ,, then y, 
is irrational for i. Of course, this does not imply that X| is 
rational, since better moves may still be available. 
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In general, it will not be possible to fully specify the 
value of pay(i,for all m n since there is not full in¬ 
formation as to the moves that the other agents will make. 
Instead, we use (1) to show that some moves are not ra¬ 
tional. Because the dominance relation is transitive but 
irreflexive (and there are a finite number of moves), it is 
impossible to show that all moves are irrational. 

2.2 Rational Offer Groups 

We define a rational offer group in a way analogous 
to how we defined a rational move above. We denote by 
*) the set of rational offer groups for agent % in game 
p, and characterize a rational offer group by the following 
constraint on members: 

elm j [pay (t, m,, P,) < pay(t,m l ,0,)] P, $ fl 0 (p,i). (2) 

In other words, if for some move m x every possible payoff 
resulting from offer group P t is less than every possible 
payoff resulting from offer group 0,, then P, is not a ra¬ 
tional offer group. 

There is one more constraint on members of Ro(p,i): 
rational offer groups specify (through the function p) a 
continuous range of payoffs that are acceptable to an agent. 
Intuitively, a rational offer group must reflect the notion 
of "monotonic satisfaction"- -if a rational agent is satisfied 
with a particular payoff, he will be satisfied with one of 
equal or greater value (this is a fundamental meaning of 
"utility"). Formally, we write 

[p(i, *) < p(»> *) A r C O,] => a € Oi ( 3 ) 

for all 0, g and moves r and a . For a particular 

game and player, a rational offer group can thus be unam¬ 
biguously specified by any of its members with the lowest 

payofT. 

In general, there may be more than one rational offer 
group for an agent in a game. If full information were 
available to an agent about the offers others were going 
to make (along with their "backup moves"), it would be 
trivial to determine /?„{;>,i). In practice such information 
is not available, but a rational agent i may be able to 
discover some rational olfer group, i.e., some offer group 
provably in R 0 (p,i). 

2.3 Rational Moves and Offer Groups for a Set of 
Players 

We also wish to define the rational moves and the 
rational offer groups available to a set of players. For * *- 
P t we denote by /? m (p,S) the rational moves for the group 
S in the game p. It follows that the members of fl«M) 
arc elements of M s . We assume that 

/up,*’) c: 'Up,s') * for s ' c s - 


This states that no rational move for a set can require irra¬ 
tionality on the part of a subset. An obvious consequence 
of this assumption is that 

■S»n(p,S) C 

A move that is rational for a group of players is thus ra¬ 
tional for each player in the group. 

Similarly, we denote by /?„(p,S) the set of rational 
deals for S in the game p (that is, the members of/2 u (p,S) 
are sets of elements from M P ). It is the "crossproduct- 
intersection" of rational offer groups for the individual 
agents: 

Ro(r,S) = {O : 0 = f) 0, A 0, 6 R 0 {p,j)}. 

>es 

2.4 Rationality Assumptions 

The value of pay(i,m,,yi) will depend, of course, on 
the values of allowed m (i, m,) and allowcd 0 (i, y t ) (i.e., the 
moves and the deals that other agents can make). In order 
to constrain the value of pay, we now define each of the 
allowed functions (allowed m is defined as in [8]). 

1. Minimal move rationality: 

allowed m (i,m t ) = M, Each player assumes that the 
others may be moving randomly. 

2. Separate move rationality: 
a//oiuerf„ 1 (t,m 1 ) C /J m (p,i). Each player assumes the 
others arc moving rationally. 

3. Unique move rationality: 

Fur all m, and rr»', allmi)ed vl {i,m,) - aHowetU*> m i) 
and \alloed m [i, m t )1= 1. Each player assumes that 
the others* moves are fixed in advance. This may be 
combined with separate rationality. 

The assumptions above do not fully specify what is 
or is not a rational move. Rather, they help constrain the 
set of rational moves by allowing us to prove that certain 
moves are not rational. We* now define analogous assump¬ 
tions regarding deals other agents might be making: 

1. Minimal deal rationality: 

ol{otfiet<, > (i,v 1 ) C P(Mp), where P{Mr) denotes the 
power set of Mp. Each player assumes that the others 
may be making random deals. 

2. Separate deal rationality: 

aUoiutd„(i t y,) C Each player assumes that 

the others are making rational deals. 

3. Unique deal rationality: 

For all Di and /£;, Di\ = allowdji, Ei) 

and allowed,,— 1. Each player assumes that 
the others' offers arc fixed in advance. This may be 
combined with separate deal rationality. 
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We refer to the combination of separate and unique 
move rationality as individual move rationality, and to the 
combination of separate and unique deal rationality as in¬ 
dividual deal rationality. As in [8], any move that can be 
proven irrational under the assumption of minimal move 
rationality will be similarly irrational under the other move 
rationality assumptions. Analogously, any offer group that 
can be proven irrational under the assumption of minimal 
deal rationality will be irrational under the other deal ra¬ 
tionality assumptions. 

§3. Rational Deal Characteristics 

With our notational conventions defined, we can now 
prove several characteristics of henceforth use 

S to denote any move that gives agent i his highest payoff. 

Theorem 1 (Existence of a non-null rational offer 
group). |R„(p,*)| > 1. 

Proof. If ft„(p,t) were empty then i would do best by 
making no offers and relying on his move to generate his 
payoff. But pay{i t m,,s) will be greater than or equal to 
pay(t,m,,0) for all m t (since a will either be matched by 
other agents, increasing i's payoff, or will not be matched, 
and will therefore be harmless since it doesn't affect other's 
moves). Thus the offer group {*} would also be in 
guaranteeing it to have at least one non-null member. 

It follows directly from the definition of a rational offer 
group (3) that all non-empty members of i's set of rational 
offer groups include a Together with Theorem 1, this 
implies that it is always rational for an agent to include in 
his offer group the move that gives him his highest payoff. 

In addition, an agent can often restrict his offers to 
those whose payoffs arc higher than that which he can get 
by making the null offer, relying on his move to give him 
this payofT. 

Theorem 2 (Lower bound). Assuming unique deal ra¬ 
tionality, if for any move m, and joint move y / a, 

aO.lOi € A y $ O,]. 

Proof. There arc two cases: 

1. p(t, y) < pay(t,T7»j,0): The only way for y to be in 
some rational olTcr group P, is for the y deal not to 
be accepted (otherwise pay^m,, Pi) would be dom¬ 
inated by the offer group pcij/(t,m,,0,) where Oi = 
{z : p(t, x ) > p(t, y)}). But if y is not accepted, then 
it is equivalent to another offer group that includes 
only those moves with payoffs higher than y. This 
smaller offer group will then also be in 


2. p(i, y) = pat/(t,m u 0): Assume that y is in some 
rational ofTer group P,. If y is not accepted, or is ac¬ 
cepted along with other offers, then puy(i,m^,0,) > 
pay{i,m„P,) where 0, = {£ : p(t,i) > p{i,y)}, 
so there is another rational offer group without y, 
namely the offer group 0,. If y is the only accepted 
offer, then poy(i,m,-, s') — poy(i,m,,P,) (where a is 
the move that gives i his highest payoff), since s will 
not be accepted anyway and therefore «■»,,«*) = 
pay(t,m,,0). Again, there is a rational offer group 
that does not include y. □ 

Note that Theorem 2 will not hold for S (i.e., the joint 
move that gives / his highest payoff) since that would con¬ 
tradict Theorem 1 (Theorem 2's proof makes implicit use 
of the fact that y ^ s' in its construction of the dominating 
offer group O t ). Note also that Theorem 2 will not hold 
under minimal deal rationality. Imagine that a perverse 
opponent chooses his ofTer group as follows: 

1. If you include in your offer group deals with low payoff 
(for you), he will accept the deal with your best payofT; 

2. If you don't offer that low deal he will accept no deals 
and you will have to rely on your move to get a payoff. 

Under these circumstances (fully consistent with minimal 
deal rationality), it might be to your advantage to offer a 
low-payoff deal, "since that might be the only way to get 
your maximal payoff. 

3.1 Restricted Case Analysis 

The consequences of Theorem 2 will differ, of course, 
based on assumptions about allowed rr i since these will af¬ 
fect pay(i,m,,0) for any given m t . Consider the following 
payoff matrix: 



c 

a 

a 

1\4 

0\5 

T 

3\2 

2\7 


It is shown in [8] that, assuming minimal move ratio¬ 
nality (potentially random or even malevolent moves by 
other agents), the row agent can still use "restricted case 
analysis" to constrain his move to b. If unique deal ratio¬ 
nality can be assumed then the offer group consisting solely 
of move b\c (i.e., bottom left corner) is guaranteed by The¬ 
orems 1 and 2 to be a rational offer group. Of course, there 
may be other rational offer groups, for example the offer 
{b\d,b\c}, depending on what deals the other player can 
offer. 

Wc formalize part of the above discussion: 

Corollary 3 (Restricted case analysis). Assuming 
minimal move rationality and unique deal rationality, if for 
some x, and y t) for all x, and y„ 

p(*,v) <?(»,*)• 
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then there exists an 0, £ /?„(p,s) such that no y is in 0,- 
Proof. Follows from Lemma 3 in [8] and Theorem 2. □ 

3.2 Case Analysis and Iterated Case Analysis 

There are restrictions on rational offer groups anal¬ 
ogous to Corollary 3 that apply for case analysis and it¬ 
erated case analysis under the assumptions of unique and 
individual move rationality, respectively. The case analy¬ 
sis situation is represented in the following payoff matrix, 
seen from the row player's perspective: 



c 

d 

a 

4\1 

2 

b ; 

3\5 

0\1 


The row player need only assume that the column 
player's move will not be affected by its own move (i.e., 
unique move rationality) to realize that making move a is 
in all circumstances superior to making move 6. As long 
as unique deal rationality can also be assumed, there is 
a guaranteed rational offer group consisting only of move 
a\c. 

Corollary 4 (Case analysis). Assuming unique move 
rationality and unique deal rationality, if [or some x, and 
y, [or all X{ and yi with X\ ~ yf, 

p(t,v) < p(»>£), 

then there exists an 0, £ such that no y is in O t 

Proof. Follows from Lemma 4 in [8] and Theorem 2. a 

Similarly, if the column player can assume that the 
row player is rational and making moves independent of the 
column player's moves (i.e., individual move rationality), 
then he can prove that move d is optimal in the above 
matrix (since the row player will play a). With unique 
deal rationality, he has a guaranteed rational offer group 
of {a\d,b,\c} (the offer group {b\c} is also rational). 

The effect of Theorems 1 and 2 is to show us that there 
is always a rational offer group that includes an agent's 
highest payoff outcome, and includes no outcomes below 
or equal to what he could achieve without deals. Below, 
we consider other constraints on an agent's rational offer 
groups. 

§4. The Group Rationality Theorem 

The work in [8] and [9] was concerned with the formal¬ 
ization of cooperative behavior, given certain constraints 
about the agents participating in an interaction. Using our 
notation, a desirable general result would have been 

(4)P,y ,0) < pay(P, x ,0) => y $ R, n {p,P), 


that is, if any joint move for all players is dominated by 
any other, then the dominated joint move is not rational 
for them. This result could not be proven, and the inabil¬ 
ity to do so stemmed directly from the lack of communi¬ 
cation inherent in the model. Without at least minimal 
communication (e.g., sell-identification), there is no way 
to coordinate on a universally perceived best move when 
several such moves exist. 

We are now able to derive an important result about 
R 0 (p, P) very similar to the elusive non-communication re¬ 
sult in (4). 

Theorem 5 (Group offers). Assuming individual deal 
rationality, 

p(r, v) < p(P,i) 30, |0, £ Il u (p,i) A y $ O,] 
for ail i £ P. 

Proof. There are two possible cases: 

1. VOj[O, £ R„(p,t) ~> y </, Oj j: Since y will not be 
a consummated deal, if P, is any offer group con¬ 
taining y then p«y(»,m,,0,) > pay(i,m„ P,) whore 
0, = {£ : p(t,x) > p(*,>/)}. Along with Theorem 
1, this shows the existence of a non-null rational offer 
group without y . 

2. 3Oj\0j £ ft„(p,t) A y £ 0 j]: All other .agents are 

rational (by assumption), and any rational offer group 
that includes y also includes x (3); thus, if P, is 
any olTer group containing y, then m,,0,) > 

ptty{i,m u P,) where O, ~ {x : p(i, x) > p(t,y)}. 
Tliis, along with Theorem 1, shows the existence of 
a rational offer group without y , o 

Because of Theorem 5, a rational agent interacting 
with other rational agents knows that he need not offer 
a move that is dominated for all players—doing so can¬ 
not increase his payoff. If the other rational agents also 
know that all agents arc rational, they too will realize that 
they can refrain from offering a move that is dominated for 
all players. Higher levels of knowledge [13], such as their 
knowing that all agents know that all agents arc rational, 
are not needed. In addition, because of the definition of 
rational offer groups (3), the agents can refrain from of¬ 
fering any moves with smaller payoffs, since those groups 
would necessarily include the dominated move. 

§5. Examples 

We will now examine the consequences of our rational 
offer theorems in several additional types of games. 

5.1 Best Plan 

The best plan scenario is reflected in the following 
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matrix: 


c 

a 7 
b 5 


4 

6 


All agents recognize that there is a single best move; how 
will their offer groups reflect this? From Theorem 1, a ra¬ 
tional agent knows that he can safely offer the move that 
gives him his best payoff (i.e., move a\c), even assum¬ 
ing minimal deal rationality on the part of other players 
(though the theorem is noncommittal as to whether other 
moves can or should be included with it). All players can 
also rule out move aid using Theorem 2 if unique deal ra¬ 
tionality holds (since aid yields the lowest payoff). If there 
is an assumption of individual deal rationality, Theorem 5 
can guarantee each agent that the offer group consisting 
solely of ale is rational. Communication thus allows coor¬ 
dination on the best plan under more intuitive assumptions 
about the interaction than those used in [8], 

5.2 Breaking Symmetries — Multiple Best Plan 

Our rational offer group theorems allow us to solve 
the "Multiple Best Plans" case that could not be solved in 
[8], The following matrix illustrates the scenario: 



c 

d 

a 

-1. 

2 

b 

2 

■1 J 


case analysis implies that it is always better to play d; 
both players choosing d, however, is less desirable for both 
than if they had chosen c. The dilemma has received much 
attention within the philosophy and game theory literature 
[2, 5, 22, 27], In the usual presentation of the prisoner's 
dilemma, playing c is called "cooperating," and playing dis 
called "defecting." With the presence of binding promises, 
in fact, there is no dilemma: 

Corollary 6 (Prisoner's Dilemma). If all players know 
that all players are operating under the assumption of in¬ 
dividual deal rationality, agents will cooperate in the pris¬ 
oner's dilemma. 

Proof. The first player knows that it is rational to of¬ 
fer die (since it is rational even under minimal rational¬ 
ity, Theorem 1); he also knows it is irrational to offer eld 
(from Theorem 2, since individual deal rationality includes 
unique deal rationality). By Theorem 5, there is a ratio¬ 
nal offer group without did. Now he knows that the other 
agent will not offer die (since the other agent is assumed 
rational and operating under the assumption of unique deal 
rationality, Theorem 2). Since die will certainly not be 
met, pay(i,d, {d\c}) < pay(i,d,{d\c,c\c}). Thus, the of¬ 
fer group {d\c,c\c} is rational. The second agent will, if 
rational and working under the same assumptions, come 
to the same conclusion. The deal clc will be struck, and 
the agents avoid the did trap. □ 

§6. Extending the Model 


Assuming minimal deal rationality, an agent can rationally 
offer b\c and aid. In addition, assuming unique deal ra¬ 
tionality an agent knows that he can rationally not offer 
ale and bid (since they arc lowest yield moves). This anal¬ 
ysis can be done by both agents if they are rational and 
operating under the unique deal assumption. Their offer 
sets will overlap on the multiple best outcomes; selection 
of a single alternative from the multiple agreements then 
occurs. 

5.3 Prisoner's Dilemma 

The prisoner's dilemma is represented by the following 
matrix (we choose different names for our moves so as to 
conform to the literature): 



c 

d 

c 

3 

0\5 

d 

5\0 

1 


Each agent most desires to play d while the opponent plays 
c, then to play c along with the opponent, then to play d 
along with the opponent, and least of all to play c while 
the opponent plays d. The dilemma comes about because 


For certain types of interactions, the model presented 
above (i.e., the various assumptions and theorems about 
rational moves and deals) does not specify rational activity 
in sufficient detail. We can extend the model in a variety of 
ways to handle these cases, and at the same time capture 
a wider range of assumptions about the interaction. In 
this section, we briefly present some of the extensions that 
might be made to our original model. 

6.1 Similar bargainers 

Consider the following payoff matrix (equivalent to 
game 77 in Rapoport and Guyer's Lixonomy [23]) 



c 

d 

a 

3 

2 

b 

5\0 

0\5 


Assuming separate deal rationality, the first player 
can assume that b\c should be in a rational offer group 
of his, and that bid should not be. What else can be 
said about what constitutes a rational offer group in this 
game? There arc three choices, namely {b\c}, {a\c,b\c}, 
and {a\d,a\c,b\c}. In order to decide among the choices, 
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we would like to make more assumptions about the "bar¬ 
gaining tendencies" of the other agent (since, in fact, some 
agents might be tougher deal-makers than others). We 
will ignore what value the agents might place on making a 
particular move in the absence of a deal, since the payoff 
is underdetermined. 

Let us define two offer groups O, and 0 3 to be similar 
if and only if they both have the same lower boundary for 
what deals are included or not included. It is true that 
similar (OM if and only if 

^|p(*» y ) > n o V 6 O, A p(j, X ) > n O x € Oj] 

for some number n. If we use the similar bargainers defi¬ 
nition, wc implicitly assume some meaningful measure for 
comparing inter-personal utility. 

One assumption to use in deciding upon rational offer 
groups is now that the other agent will accept deals that 
you would accept; that is, 0 } € t>0, G 

where similar[O lt Oj). 

Under this assumption, we can decide what deal is ra¬ 
tional in the above game. Player 1 reasons that if he offers 
{b\c}, player 2 (who is a similar bargainer) will offer only 
{b\d}. There will be no match. In the same way, if it would 
be rational for player 1 to offer {a\c, b\c} then player 2 will 
offer {a\c,b\d}, with an agreement on ate and a payoff of 
{3} for both. If player 1 offers {a\d,a\c,b\c} then player 
2 will offer {a\d,a\c,6\ci} and there will be agreement on 
a\d and on ale, with a payoff of {2,3} for both. Since {3} 
dominates {2,3}, agents who assume common knowledge 
[13] of the similar bargainer assumption should choose the 
rational offer group that yields agreement on ate. 

6.2 Stochastic Model — The Game of Chicken 

Note, however, the following payoff matrix (commonly 
known as the game of chicken [23]): 



c 

d 

a 

3 

2\5 

TT 


1 


Two agents, even if they assume individual deal ratio¬ 
nality and the similar bargainers assumption, will be faced 
with the following choices: a payoff of {3} or a payoff of 
{2,3,5}. According to our definitions, neither of these sets 
dominates the other. 

If, however, we extend the model to include a prob¬ 
abilistic choice from within the agreement set, it is clear 
that the latter agreement set dominates the former (with 
an expected value of 3.33 versus 3). A further stochastic 
extension to our model would allow moves themselves to 
be specified probabilistically (e.g., a with probability .5, 
and b with probability .5). In the game theory literature, 


this is the distinction between pure strategics and mixed 
strategics [18], An analysis of this model is beyond the 
scope of the present discussion. 

6.3 Conjunctive Offers — Battle of the Sexes 

In the game of chicken example presented above, there 
was an added complexity that was temporarily ignored: 
the possibility of "defection." If one agent reasons that 
the other agent will accept all payoffs above 2, it is to 
the first agent's benefit to only offer moves of payoff 5 
(this is analogous to the prisoner's dilemma, with the same 
potential that both players will use identical reasoning and 
no agreement will be reached). A similar problem can be 
seen in the so-called battle of the sexes matrix, seen below. 



c 

! d 

a 

-1 

I\2 

T 

2\1 

-1 


One approach to solving this problem is to allow "com¬ 
posite" offers, for example, an offer consisting of a conjunct 
of several moves (the conjunct must be matched exactly in 
order for a deal to occur). Thus, the offer consisting of 
a\dAb\c can consistently be made by both agents without 
the potential of defection (and with an expected utility of 
1.5 for each). This notion can be extended to general log¬ 
ical offers consisting of disjuncts, conjuncts and negations 
ofjoint moves. The battle of the sexes can thus be uniquely 
solved with the assumption of similarity in bargaining, if 
conjunctive offers are allowed. 

§7. Previous Work 

The subject of interacting rational agents has been 
addressed within the field of artificial intelligence as well 
as in the discipline of game theory. Here we will briefly 
review relevant contributions from these two areas, and 
contrast our present approach with previous efforts. 

7.1 Work in Artificial Intelligence 

As mentioned above, researchers in distributed artifi¬ 
cial intelligence have begun to address the issues arising in 
multi-agent interactions. Lesser and Corkill [4] have per¬ 
formed empirical studies to determine cooperation strate¬ 
gies with positive characteristics (such as, for example, 
what types of data should be shared among distributed 
processors). They are solely concerned with groups of 
agents who share a common goal, but have acknowledged 
the benefit even under this assumption of having agents 
demonstrate "skepticism" (i.e., not being distracted by 
others' information). 

Georgcff [10,11] has developed a formal model to com¬ 
bine separate plans of independent agents. The primary 
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concern is to avoid destructive interference caused by si¬ 
multaneous access to a shared resource. The model used 
assumes that the agents have separate goals, but that these 
goals do not directly oppose one another. Cooperative ac¬ 
tion is neither required nor exploited, except insofar as it 
allows agents to keep out of each other's way. 

Other notable efforts include Smith's work on the con¬ 
tract net [7], Malone's work extending the contract net 
model using economic theory [19], and the theoretical work 
on knowledge and belief of carried out by Appelt, Moore, 
Konolige, Halpern and Moses [1, 14, 15, 16, 17, 20, 21], 

The current work extends these previous models of in¬ 
teraction by allowing a fuller range of goal disagreements 
among agents. By using a framework that captures to¬ 
tal and partial goal conflicts, it allows investigation into 
compromise, promises and cooperative action. 

This paper considers the communication scenario in 
ways similar to the manner in which previous work [8, 9] 
investigated cooperation among rational agents when no 
communication occurs. Below we briefly note the advan¬ 
tages that were gained when communication and promises 
were added to the interaction model. 

The best plan interaction was handled in our frame¬ 
work by assuming individual deal rationality. Because 
in the no-communication case this scenario could not be 
solved using individual move rationality, other assump¬ 
tions were introduced: informed rationality in [8] and com¬ 
mon rationality in [9], Informed rationality constrained 
allowcd m in a way that assumed each player would re¬ 
spond in a rational way to the others' moves, whatever 
they might be. 

It should be noted in passing that an assumption of 
common knowledge of rationality will also allow for a solu¬ 
tion to the best plan case, though this has not previously 
been pursued in the literature. 

To solve the prisoner's dilemma, even more assump¬ 
tions had to be introduced. The interested reader is re¬ 
ferred to |8] and [9] for full details; see also [25], 

Even using a variety of assumptions, previous work 
could not handle the multiple best plan case, where there 
arc several outcomes all equally recognized as best by all 
players. To break the symmetry, some communication 
is needed, though this communication can be as simple 
as self-identification and reliance on a common rule (e.g., 
agent with lowest name performs lowest ordered action). 

7.2 Game Theory 

Game theory has focused on a variety of interactions, 
and sought to characterize the types of actions that ratio¬ 
nal agents will take in each. Many of the same questions 
that come up in our work have been addressed by game 
theoreticians. Their approach, however, has left some im¬ 


portant issues unexamined. Consider the following quote 
from the classic game theory text, [18]: 

Though it is not apparent from some writings, 
the term "rational" is far from precise, and it 
certainly means different things in the different 
theories that have been developed. Loosely, it 
seems to include any assumption one makes about 
the players maximizing something, and any about 
complete knowledge on the part of the player in a 
very complex situation... [Games and Decisions, 
p. 5] 

As another example, consider the following best plan 
interaction: 



a 2 

b 2 


4 

1/2 

Bi 

3/1 

2/3 


It was demonstrated above that the best plan case can 
only be solved under particular definitions of rationality. 
Rapoport and Guyer, however, writing in [23], put for¬ 
ward the following assumption regarding agents' behavior 
(citing the similarity with [26]): 

{A 3 ). If a game has a single Pareto equilibrium, 
the players will choose the strategy which con¬ 
tains it... 

Our assumption (A3) says that A,A 2 is the natu¬ 
ral outcome, which, of course, is dictated by com¬ 
mon sense... we shall refer to this as a prominent 
solution. [A Taxonomy of 2 X 2 Games] 

In short, game theory has sometimes been willing to 
take for granted certain types of behavior without carefully 
formalizing its definitions of rationality, or its assumptions 
of inter-agent knowledge. 

These questions are particularly important in the field 
of artificial intelligence. We arc not interested in charac¬ 
terizing game matrices: we want to characterize agent ra¬ 
tionality and explore the consequences of various assump¬ 
tions. The goal is to be able to implement intelligent agents 
whose strategies of behavior will be provably rational. 

§8. Conclusion 

Intelligent agents will inevitably need to interact flexi¬ 
bly in real world domains. Previous work has not modeled 
the full range and complexity of agents' varied goals. The 
benevolent agent assumption, which assumes that agents 
have identical or non-conflicting goals, has permeated pre¬ 
vious approaches to distributed Al. 

This paper has presented a framework for interaction 
that explicitly accounts for communication and promises, 
and allows multiple goals among agents. The model pro¬ 
vides a unified solution to a wide range of problems, in¬ 
cluding the types of interactions discussed in [8] and [9], 
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Through the use of communication and binding promises, 
agents are able to coordinate their actions more effectively, 
and handle interactions that were previously problemati¬ 
cal. By extending the communication model even further, 
a wider variety of interactions can be handled. 
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